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Abstract 

We give a description of the minimal primes of the ideal generated by the 
2x2 adjacent minors of a generic matrix. We also compute the complete prime 
decomposition of the ideal of adjacent mxm minors of an m x n generic matrix 
when the characteristic of the ground field is zero. A key intermediate result 
is the proof that the ideals which appear as minimal primes are, in fact, prime 
ideals. This introduces a large new class of mixed determinantal ideals that are 
prime. 

1 Introduction 

Let X mn be an m x n matrix of indeterminates Xij which generate the polynomial 
ring K[xij] where K is a field. The ideal generated by all k x k minors of X mn has 
been studied from many different points of view; for a comprehensive exposition see 
|3j and Chapter 7] . For example, these ideals are prime ideals that are also Cohen- 
Macaulay jTHI, and they are Gorenstein when m = n JE]. Similar determinantal 
ideals where one mixes minors of different sizes have been also studied. For instance, 
in the context of invariant theory and algebras with straightening laws one looks at 
the ideal of minors generated by a coideal in a particular poset of all minors 
There are also many variations such as ladder determinantal ideals pj, and mixed 
ladder determinantal ideals jH] where the ideals of (mixed) minors in a ladder-shape 
region in X mn are studied. In both cases these ideals are prime and Cohen-Macaulay, 
and criteria for when they are Gorenstein are characterized. 

A kx k adjacent minor of X mn is the determinant of a submatrix with row indices 
ri, . . . , Tfc and column indices ci, . . . , c*, where these indices are consecutive integers. 
We let Imn{k) be the ideal generated by all of the k x k adjacent minors of X mn . As 
opposed to the ideal of all k x k minors, the ideal I m n{k) is far from being a prime 
ideal. This ideal first appeared in [7j for the case k = 2 where primary decompo- 
sitions of /2n(2) and J 44 (2) were given. The motivation for studying I mn (2) comes 

*Partially supported by the National Science Foundation (DMS-0200729) 
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from the rapidly growing field of algebraic statistics ^3], [13 Chapter 8]: a primary 
decomposition of I mn (2) helps to measure the connectedness of the set of m x n 
contingency tables with the same row and column sums via the moves corresponding 
to the 2x2 adjacent minors [7j. 

The goal of this paper is to study the minimal primes of I mn (k). A motivation is 
related to algebraic statistics and focuses on the case when k = 2 in Section 2, and 
on the case of adjacent minors of higher- dimensional matrices in Section 5. We give 
in Section 2 a combinatorial description of the minimal primes of I mn (2). This ideal 
is a very special instance of a lattice basis ideal, and minimal primes of lattice basis 
ideals have been characterized [TT]. However, in the case we treat here we get a more 
transparent characterization. 

In Section 3 we analyze the case when k = m, i.e. the maximal adjacent minors of 
an m x n matrix where m < n. In this case, I mn (Tn) is a complete intersection that is 
also radical. We present a combinatorial description of the minimal primes and give 
a recurrence relation for the number of these primes. These prime ideals are a very 
general type of mixed determinantal ideals that, to our knowledge, have never before 
been studied. All the usual questions can be asked about them, however, even the 
fact that they are prime seems to be a challenging result. Section 4 is the technical 
heart of the paper: it is devoted to the proof that these mixed determinantal ideals 
are, in fact, prime. A string of arguments that culminates in Theorem 14.201 proves 
this result when char(K) = 0. In arbitrary characteristic we also show that they are 
prime in special cases including the case when m < 3. On the way to proving these 
results we show that the minors that generate these mixed determinantal ideals form 
a squarefree Grobner basis when the characteristic is arbitrary. 

Section 5 is a look into the future with a view towards applications in algebraic 
statistics. We introduce the notion of adjacent minors of a generic m\ x x • ■ ■ x 
matrix. These come from the study of discrete random variables A 1; . . . , where 
each Xi takes values in {1, . . . , m,}. A particular family of statistical models that de- 
scribe the joint probability distributions of these random variables (the so-called no d- 
way interaction models [S]) gives rise to a toric variety whose set of defining equations 
may be extremely large and complicated |TJE3. However, the positive probability dis- 
tributions are described precisely by the simple multidimensional adjacent minors we 
will introduce. The story of the minimal primes of these ideals is far from complete, 
but in Theorem 15.31 we will describe them in the case mi = m2 = ■ ■ ■ = rrid-i = 2. 

2 2x2 Adjacent Minors 

From the general characterization of minimal primes of lattice basis ideals ^T] it 
follows that every minimal prime P of I mn (2) is of the form 

P = ( Xij : Xij e S) + J : ( J] Xij )°° (1) 

where S is a subset of the variables in the ring If [icy] and J is the ideal generated 
by the 2x2 adjacent minors in the ring K[ ^ S]. In other words, P is 
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boundary edges and boundaries 



uniquely determined by the variables it contains. We will denote this set of variables 
by Sp, and the variables not in Sp by N P . In the rest of this section we will give a 
characterization of the sets Sp and Np that give rise to the minimal primes of I mn (2). 
In order to describe these minimal primes we need a few definitions. 

Let S be a subset of variables of K[xij}. We say that two variables Xij and 
adjacent if s — i + ei and t = j + t2 where ei,e 2 G { — 1,0,1}. The set S is connected if 
for every pair of variables {x^, x st } C S there is a sequence of variables in S starting 
with x^ and ending with x st , and such that each variable in the sequence is adjacent 
to the variable preceding and following it. A subset T of S is called maximally 
connected if there is no larger connected subset of S containing T. A set of variables 
S is a rectangle X[i,j; s, t] if it is equal to the set of all the variables in the submatrix 

^ "^ij ' ' ' "^it ^ 
\ ~^ s 3 ' ' ' "^ s t / 

The boundary edges of X[i,j; s,t] are the four rectangles X[i — — l,t], X[s + 
1, j; s + 1, t], X[i, j — 1; s, j — 1], and X[i, t + 1; s, t + 1]. The boundary of X[i,j; s, t] is 
the union of the the four boundary edges together with the "corner" variables Xi-ij-i, 
x s+ ij-i, Xi-ij+i, and x s +i t t+i- When we speak of boundary edges and the boundary 
of a rectangle we always mean only those parts that are defined, since some boundary 
edges or corner variables might not exist because they are outside of the matrix X mn . 

Example 2.1 Let m = 6 and n = 7. In the matrix X 67 , the two rectangles 
X[l, 1; 3, 1] and X[3, 5; 5, 6] together with their boundary edges and boundaries can 
be viewed in Figure 1. The first rectangle has only two boundary edges since the 
other two are not defined. □ 

Definition 2.2 We will call a partition (S, N) of the variables in X mn a prime par- 
tition if S and satisfy the following properties: 

1. N contains the variables in, Xi n , x mi and x mn , 

2. when N is written as the disjoint union of its maximally connected subsets 
N = \J k 7fc, then each T k is a rectangle, 
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3. each boundary edge of a maximal rectangle T^ in N has a nontrivial intersection 
with the boundary of another maximal rectangle Tg, 

4. the boundary edges of two maximal rectangles of width (height) one in the same 
column (row) do not intersect, and 

5. S is the union of the boundaries of the maximal rectangles Tj,. 

Theorem 2.3 The prime ideal P is a minimal prime of I mn {2) if and only if(Sp, Np) 
is a prime partition. 

The rest of the section is devoted to the proof of Theorem 12 .31 We remark that this 
theorem does indeed cover the characterizations of minimal primes of I mn (2) in the 
known cases, in particular, that of l2 n (2) in [7] and of I%n(2) in [TT]. Before starting 
the proof we give an example to illustrate the definition above and the content of the 
theorem. 

Example 2.4 Figure El displays all the minimal primes of 155(2). This is the smallest 
example where all five conditions in the Definition 12.21 are needed. In this case there 
are 92 minimal primes that can be grouped into 19 equivalence classes modulo sym- 
metries. We show one member from each equivalence class. The boxes in Figure [2] are 
the maximal rectangles in the Np of the corresponding prime partition, and the solid 
buttons correspond to the variables in Sp. The first number following each diagram 
is the size of the equivalence class and the second is the degree of the corresponding 
prime ideal. □ 

Now we begin the proof of Theorem 12.31 with a sequence of lemmas. The first one 
concerns the first property in Definition 12.21 and is taken from Lemma 3.3 in [TT] . 

Lemma 2.5 The corner variables i n , x\ n , x m \, and x mn do not belong to Sp for 
any minimal prime P of I mn (2). 

Lemma 2.6 If P is a minimal prime of I mn (2), then every maximally connected 
subset of Np is a rectangle. 

Proof. Let T be a maximally connected subset of Np and suppose that the adjacent 
variables and Xi + ij + \ are in T. Since these two variables are not in P, the only 
way the adjacent minor x^Xj+ij+i — Xij+iXi+ij could be in P is if the variables Xj+ij 
and x%,j+i also belong to Np. Since T is maximally connected these two variables are 
also in T. Similarly, if Xj+ij and Xjj+i belong to T then Xij and Xi+ij+i are also in 
T. This implies that any maximally connected subset of Np is a rectangle. □ 

The general description of the minimal primes in (JJJ) together with Lemma 12.61 
imply that if P is a minimal prime of J mn (2), and Np, the set of variables not in P, is 
written as the disjoint union of its maximally connected rectangles, say Np = |J fc Tfc, 
then 

P = (xij : Xy e S P ) + (xijX s t - x it x S j : all of scy, x st , x it , x sj are in the same T k ). 
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Figure 2: The minimal primes of 155(2) 
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Lemma 2.7 Let P be a minimal prime of J mn (2) and let T be a maximally connected 
rectangle of Np. Then the boundary of T is a subset of Sp. Moreover, for each 
boundary edge E of T there is another maximal rectangle T' C Np whose boundary 
has a nonempty intersection with E. 

Proof. The boundary of T is a subset of Sp since T is maximally connected. To 
prove the second statement, suppose that there were a maximal rectangle T with a 
boundary edge E that does not intersect the boundary of any other maximal rectangle. 
Consider the prime ideal P' where Spi = Sp\E, and Npi = Np U E. The assumption 
on the edge E implies that T' = T U E is a maximally connected rectangle of Np/. 
The new prime ideal P' still contains all the adjacent minors. The only new 2x2 
minors that appear in the ideal P' involve variables from E, and these are already 
contained in P. This implies that P' is a prime ideal contained in P, contradicting 
the minimality of P. □ 

Lemma 2.8 Let P be a minimal prime of I mn (2) and let T = X[i,j;i,s] be a maxi- 
mally connected rectangle in Np of height one. Then there is no maximally connected 
rectangle of height one in Np of the form T' = X[i, s + 2; i, t]. A similar statement 
holds for vertical rectangles of width one. 

Proof. By Lemma 12*771 the rectangles X[i — — l,t] and X[i + + l,t], and 
the variable Xi jS +i are in Sp. Since the variables of T and T' do not appear in any 
generator of P, the prime ideal P' given by the set of variable Sp \ Xi )S+ \ is a strictly 
smaller prime ideal which contains I mn (2), contradicting the minimality of P. □ 

Lemma 2.9 If P is a minimal prime of I mn (2), then every variable in Sp belongs to 
the boundary of some maximal rectangle in Np. 

Proof. Suppose that Sp contains a variable Xij that is not in the boundary of any 
maximal rectangle in Np. This implies that x^ is adjacent only to variables in Sp. 
Let U C Sp be the set of variables that are adjacent to x^. The 2x2 adjacent 
minors contained in the ideal generated by the variables in U U the same as 

those contained in the ideal generated by the variables in U alone. Hence by omitting 
the variable x^ from P we can construct a prime ideal that contains J mn (2), but 
strictly contained in P. This is a contradiction to the minimality of P. □ 

With the help of the five lemmas we have presented we are ready to prove the 
main theorem of this section. 

Proof of Theorem \2. OA If P is a minimal prime of J mn (2), the partition (Sp,Np) 
satisfies all the five properties to be a prime partition because of the five lemmas, 
Lemma 12.51 through Lemma 12.91 above. Hence we just need to prove the converse. 
Suppose (S, N) is a prime partition, and we assume N = UfcT fc is the partition of N 
into its maximally connected rectangles. We will show that the prime ideal 

P = (^Xij '. €E S/ ~t~ (^XijXgf XtfXsj '. all of x%j^ Xgt^ Xn^ x$j are in the same T^j 
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is a minimal prime of J mn (2). Since all the Tk are rectangles, it is easy to see that 
P contains I mn (2). Suppose that there were a minimal prime P' over I mn (2) strictly 
contained in P. This means that (Spi,Npr) is a prime partition, and Sp> is a proper 
subset of Sp = S. We consider a variable x^ in Sp \ Sp>. By Lemma I2~UI x^ lies on 
the boundary of some maximal rectangle T of Np = N. The variable x^ either lies 
on a boundary edge E of T, or is a corner variable on the boundary of T. In the first 
case, since (Sp/, Np>) is a prime partition, E C Sp \ Sp>, and therefore E is a subset 
of Npr. Moreoever E intersects the boundary of at least one other rectangle T' of 
Np. This means T U E U X" is a connected subset of iVp/, and this union must be 
contained in a maximally connected rectangle T" of iVp/. If is a corner variable of 
the boundary of T, then the two boundary edges E and E' of T that are adjacent to 
must be a part of iVp/. Now by repeating the above argument we are guaranteed 
to have another rectangle T' of Np where T U E U T' is contained in a maximally 
connected rectangle T" of Np> . By the fourth property of Definition 12.21 T and T' 
could not be both height (width) one rectangles in the same row (column) of X mn . 
Hence there are variables x st e T and x pq G T' where s 7^ p and t ^ q. Since these 
variables are in the same maximally connected rectangle T" of Npt, the 2x2 minor 
%stXpq — x sq Xpt is in P'. On the other hand, the set of variables appearing in this 
minor is not contained in any maximally connect rectangle of Np and so it does not 
belong to P. This contradicts the assumption that P 1 C P. □ 

There are many open questions left to answer about I mn (2). A combinatorial 
description of the embedded primes remains elusive. Moreover, there are many in- 
teresting open questions regarding the minimal primes. For example, how many are 
there, which minimal primes have the largest dimension, and what is the degree of 
the radical rad(/ mn (2))? 



3 Maximal Adjacent Minors 

In this section we will describe the complete primary decomposition of the ideals 
Imn{m) for m < n over a field K of characteristic zero, and for m < 3 in arbitrary 
characteristic. With no restrictions on the characteristic of the field our description 
presents I mn {m) as the irredundant intersection of radical ideals. 

Proposition 3.1 The ideal I mn (m) is a radical ideal that is a complete intersection. 
Its codimension is n — m + 1 and it has degree m n ~ m+1 . 

Proof. With respect to the lexicographic term order where xn y X\i y ■ ■ ■ y x\ n y 
X21 y ■ ■ ■ y x mn , the set of m x m adjacent minors of X mn is a Grobner basis of 
I mn (m). This follows from the fact that the initial terms of these minors are pairwise 
relatively prime. The initial ideal is a radical ideal that is a complete intersection, 
and hence so is I mn (m). Since there are n — m + 1 maximal adjacent minors, the 
codimension of I mn (m) is n — m + 1 and its degree is m n ~ m+1 . □ 
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Below we will give a description of the minimal primes of I mn (m). In this sec- 
tion we will show that I mn (m) is the irredundant intersection of these radical ideals. 
The proof that they are prime in characteristic zero and when m < 3 for arbitrary 
characteristic occupies Section 4. 

Description of the minimal primes 

In order to make the narrative cleaner we will assume that the matrix X mn has two 
phantom columns: a column indexed by and another by n + 1. (The role of the 
phantom columns is only to make the description of the minimal primes simpler.) 
We will denote by with 0<i<j<n + l the interval of column indices 
{i, % + 1, . . . , j — 1, j} of X mn , and X[i, j] will denote the submatrix consisting of the 
corresponding columns of X mn . 

Definition 3.2 Let T = {[a±, b±\, [02, 62], • • • , [a*;, H]} be a sequence of k intervals. 
The sequence T is called a prime sequence if it satisfies the following properties: 



2. aj < a i+ i, bi < b i+ i for all i, 

3. bi — di > m for all i, and 

4. < bi — a i+ i < m — 1 for all i. 

The definition says that each interval of T is a block of more than m columns and 
all together they cover all the columns of X mn (including the two phantom columns). 
Moreover the consecutive intervals in the sequence have a nonempty overlap of width 
less than m. Given a prime sequence Y we let Pr be the ideal in K[xij] defined by 

1. all m x m minors of X[aj, bi] for each [a^, bi] G T, and 

2. all (maximal) (6j — a i+1 + l) x (bi — aj + i + l) minors of X[a,i + i, bi] for 1 < % < k — 1. 

In other words, Pr is generated by the mxm minors of the submatrices whose columns 
are indexed by the intervals in T, and the maximal minors of the submatrices whose 
columns are indexed by the overlap of consecutive intervals. An example will do the 
best job to illustrate this construction. 

Example 3.3 We display the minimal primes Pr of /36(3). There are seven primes 
corresponding to the seven prime sequences: 



1 



\J[oi,bi] = [0,n + l], 



r\ = 

r 2 = 

r 3 = 

r 4 = 

r 5 = 

r 6 = 

r 7 = 



{[0,7]} 



{[0,3], [3, 7]} 
{[0,3], [2, 7]} 
{[0,4], [4, 7]} 
{[0,4], [3, 7]} 
{[0,5], [4, 7]} 



{[0,3], [2, 6], [4, 7]} 
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Figure 3: The minimal primes of I^q (3) 

Figure El illustrates these minimal primes. The rectangles with the solid borders 
describe the intervals in the corresponding prime sequence. All 3x3 minors of 
each rectangle are included in the corresponding minimal prime. We also indicate 
the overlaps by rectangles with dashed borders; all the maximal minors in these 
submatrices also need to be included in the corresponding minimal prime. □ 



The Main Theorem 

We now present the proof that the ideals Pp describe the prime decomposition of 
I mn (m) in characteristic zero and when m < 3. The following lemma will be needed 
for the proof of Theorem 13.51 

Lemma 3.4 The variety V(I mn (m)) is contained in [J V(Pr) where the union is taken 
over all prime sequences of [0, n + 1] . 

Proof. We will show that for each matrix X G V(I mn (m)) there is a prime sequence T 
such that X G V(Pr). We describe an algorithm that constructs this prime sequence 
T. For this, let X(X) := {[ci, dj, . . . , [c t , d t )} be the set of all intervals of width less 
than m in [1, n] such that X[cj, dj\ has rank di — Ci, and [q, di\ (jL [cj, dj] for i ^ j. We 
assume that c\ < ■ ■ ■ < c t . We define a prime sequence T as follows: 

1. Set i = ai = b = and T = 0. 

2. While bi + n + 1 do 

(a) i := i + 1. 

(b) Let [cj^dj] G X(X) be the first interval in [ai,n + 1] with Cj t > + 1. If 
there is no such interval set &j = n + 1. 

(c) If dj. < ai + m set 6, = + m, unless a« + m > n in which case set 
bi — n+ 1. Otherwise set 6« = d^. 
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(d) r:=ru{[a,A]}. 

(e) If hi 7^ n+1, let \pj v q^] G X(X) be the last interval in [a*, Set a i+ i = pj v 

3. If the last interval in T has width less than m+ 1 replace it with [n+1 — to, n+ 1] . 

Step 2(c) together with step 3 guarantees that the intervals in T have width at least 
m + 1. Moreover, step 2(e) implies that consecutive intervals have a nonempty overlap 
of width less than m. These show that T is a prime sequence. 

Next we show that X is in V(Pr)- By the above construction of T the overlap 
[aj + i, bi] of two consecutive intervals contains one of the elements [q, di] of X(X). Since 
X[q, rfj] is rank-deficient (it has rank — q instead of — q + 1), so is X[aj + i, 
and the corresponding (6, — Oj+i + 1) x (bi — a i+ i + 1) minors vanish on X. 

We need to show that the rank of X[aj, bi] for each [a,, 6j] G T is at most m — 1. 
For this we analyze a few different cases. First suppose that the width of [a^fej] is 
bigger than m + 1. The above algorithm implies that there are either zero, one, two, 
or three intervals from X(X) that are in [aj,&j]. When there are no such intervals 
then T = {[0, n + 1]}, and the matrix X does not have any rank-deficient submatrices 
consisting of less than m adjacent columns. So X[l,m — 1] has full rank and these 
columns generate a subspace V with dim(F) = m — 1. But since the span of X[l, m] 
is also V and X[2, m] has rank m — 1, the span of X[2, m + 1] and hence the span of 
X[l, m + 1] is V. Now by induction it is easy to see that the span of X is the (m — 1)- 
dimensional space V, and therefore all m x m minors vanish on X. If there is one 
interval from X(X) inside [a*, bi], then either [aj, bi] = [a*, n + 1] and the only minimal 
rank-deficient interval is of the form [oj, c] with c < n, or [oj, bi] = [0, bi] and the only 
minimal rank-deficient interval is of the form [c, bi] with c > 1. In the first case, the 
submatrix X[aj + l,n] has at least m columns, and this matrix does not have any 
rank-deficient submatrices consisting of less than m adjacent columns. By the same 
argument above we conclude that the span of X[a« + l,n] is an (m — l)-dimensional 
subspace V . But since X[a i; c] is minimally rank-deficient we conclude that the span 
of X[oj, n] is V , and therefore all m x m minors corresponding to this interval vanish 
on X. A symmetric argument applies when [aj,6j] = [0,6j]. 

In the case where there are two intervals from X(X), the two minimally rank- 
deficient intervals are of the form [a*, c] and [d, bi] where c < bi and d > ai or of the 
form [cLi,c] and [a, + l,d] which forces the interval [aj,6j] = [aj,n+ 1]. This means 
that, in the first case, X[aj + — 1] has at least m columns and does not have 
any rank-deficient submatrices consisting of less than m adjacent columns. Similar 
considerations as above show that Xfa, + — 1] has rank m — 1. Since X[aj,c] 
is minimally rank deficient and X[aj + 1, Oj + m] is not rank deficient we see that 
the column of X indexed by a« is in the span of the columns of X[dj + l,Oj + m] 
and so X[aj,6j] has rank m — 1. In the second case, the usual argument implies 
that X[aj + l,n] has rank m — 1. But since X[aj,c] is minimally rank deficient and 
X[cij + 1, c] is not rank deficient we see that the column of X indexed by Oj is in the 
span of the columns of X[aj + 1, n] and so X[aj, n] has rank m — 1. Finally, we consider 
the case where there are three intervals from X(X) in [oj,6j]. By construction these 
are necessarily of the form [a*, c], [oj + 1, d], and [e, 6,]. But then the combination of 
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the two arguments for the cases with two minimally rank deficient intervals shows 
that X[eij, b J has rank m — 1. 

The case where the width of [a*, &;] is exactly m + 1 requires a slightly different 
argument. If [a*, &«] = [0, m] or [n + 1 — m, n + 1] there is nothing to show since there 
is only one mxm minor that needs to be considered and it is necessarily an adjacent 
minor. If we are not in these two trivial cases, the construction of T implies that 
there are at least two intervals from X(X) contained in [a^fej]. Let [c,d] be the first 
such interval and [e, /] the last such interval. Observe that we have c = a«. Now if 
these two intervals do not overlap then any mxm submatrix of X[cij, bi] will contain 
one of these rank-deficient intervals and hence its rank will be at most m — 1. If there 
is an overlap we have ca < e < d < f < b^. The rank of the submatrix X[ctj,(i] is 
d— cii, and the rank of X[e, /] is / — e. Moreover, since these intervals are minimally 
rank-deficient the rank of X[e, d] is d— e+ 1. But then the rank of X[aj, bj\ is at most 



Theorem 3.5 Let K be a field of arbitrary characteristic. Then the ideal of adjacent 
minors I mn (m) can be written as the irredundant intersection of radical ideals 



where the intersection runs over all prime sequences of [0, When char(K) = 

or when m < 3 in arbitrary characteristic this is a minimal prime decomposition. 

Proof. Since Pr is radical by Corollary 14.51 the intersection f] Pr is also radical. 
Moreover, given any prime sequence T, each adjacent m x m minor belongs to Pr 
since the column indices of this minor are either contained in an interval [aj, bi\ in T 
or they contain the indices of one of the overlaps [a i+1 ,bi\. This shows that I mn (m) 
is contained in this radical ideal. If K is algebraically closed, Lemma 13.41 and the 
Nullstellensatz imply that I mn (m) is equal to the intersection. Since all the ideals 
in question lie in for any field K we deduce that the equation holds over any 

field by passing to the algebraic closure. In order to prove that this intersection is 
irredundant we need to argue that if V ^ V then Pr and Pr' are incomparable. This 
is a consequence of our Grobner basis arguments and is proven in Corollary 14.61 The 
intersection is a prime decomposition in characteristic zero because Pr is prime when 
char(i^) = 0: this is the content of Theorem 14.201 Similarly, all the ideals Pr are 
prime when m < 3 and the characteristic is arbitrary. This is proven in Corollary 14. 151 



Theorem 3.6 Let f m {n) be the number of primes in the prime decomposition of 
I mn {m). Then f m {n) is generated by the following recurrence: 



(d-a i ) + {f-e)-(d-e + l) + (b i -f)=b i -a i -l = m-l. 



This completes the proof of the lemma. 



□ 



Imn{ni) = P|P r 



□ 




i=0 
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subject to the initial conditions / m (l) = / m (2) = • • • = f m (m — 2) = 0, f m (m — 1) = 1 
and f m {m) = 1. 

Proof. We count the prime sequences T on [0,n + 1]. There are no such sequences 
when n < m — 1 and there is a unique sequence when n = m- lorn = m. If the last 
interval [a*, n + 1] in T has width greater than m + 1 then T' = T — [a*, n+ 1] U [a*, n] is 
a prime sequence of [0, n]. If the width of [a$, n + 1] is to + 1, then r' = T — [a,, n+ 1] 
is a prime sequence of [0, n + 1 — j] for 2 < j < m. This gives an injective map from 
the set of prime sequences of [0,n + 1] to the disjoint union of prime sequences of 
[0, n + 1 — to], [0, n + 2 — to], . . . , [0, n). It is also easy to see that the inverse of this 
map is injective. Hence these two sets have the same cardinality which proves the 
theorem. □ 



4 A new class of prime determinantal ideals 

We now prove that the ideals Pp axe prime ideals in characteristic zero. We believe 
they are prime in arbitrary characteristic and we verify this conjecture in special 
cases. First we will show that Pp is a radical ideal through a Grobner basis argument 
which does not depend on char (if). Then we use an intricate geometric argument to 
show that V(Pr) is irreducible over fields of characteristic zero. 

A Grobner basis 

We will use the diagonal term order introduced in Proposition 13.11 The argument 
will also depend on the following lemma proved in 0j. 

Lemma 4.1 Let I and J be two homogeneous ideals of a polynomial ring K[x±, . . . ,x n ], 
and let F and G be Grobner bases of I and J with respect to a fixed term order -<. 
Then F U G is a Grobner basis of I + J with respect to -< if and only if for every 
f G F and g G G there exists h G I H J such that m(h) = LCM(in(/), m(g)). 

Our main Grobner basis result follows from the result below. 

Lemma 4.2 Let F be the set of m x to minors of X mn and let G be the set of 
the k x k minors of the submatrix which consists of either the first or the last k 
columns of X mn where k < to. Then with respect to the lexicographic term order 
x\\ y a; 12 >-•••>- x\ n >-•••>- x mn the set F U G is a Grobner basis of the ideal it 
generates. 

Proof. We prove the case where G is the set of the k x k minors of the submatrix Y 
consisting of the first k columns of X mn since the other case follows from a symmetric 
argument similar to the one we give below. We will use Lemma 14.11 where I = (F) 
and J = (G). Note that F and G are Grobner bases for I and J with respect 
to the given term order by For / G F and g G G we want to show that 

there is h G / D J such that in(/t) = LCM(in(/), in(g)). We will construct h as 
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follows: let in(/) = Xu 1 X2i 2 ■ ■ ■ x mim where 1 < i\ < i 2 < ■ ■ ■ < i m < n, and let 
in(g) = x 3ll Xj 2 2 • • -Xj k k where 1 < j\ < 32 < 4 ■ ■ < jk < m - It is n °t hard to see that 
if in(/) contains a variable x S j s where i s < k then for the corresponding variable Xj is i B 
of in(g) we have ji s > s. Let Y\ be the set of columns of Y indexed by the jt with 
ji s = s, and let Y 2 be the set of those columns of Y indexed by those jt which have 
ji s > s. Moreover, let Y 3 be the set of columns that do not contain a variable from 
in(/); that is, Kj consists of the columns of Y which are not in Y\_ or Y 2 . Finally, Y 4 
will be the set of columns of X mn with indices {i t : it > k}. We make two simple 
observations. First of all, the sum |Yi| + + \Yi\ is equal to m, and secondly, Y\ 
comes before all of the other Yi in X mn : indeed, Y± is the first |Y"i| columns of X mn . 
Now let us look at the rows of Y in which a variable of in(g) that is also either in Y 2 
or y 3 appears. These rows form a (II2I + I "^3 1) x k submatrix of Y that we will denote 
by A. With all this data we construct the (m+ \ Y% \ + jY-j)) x (m + II2I + 1^3 1) matrix 



" A 





Y 


Y 2 | Y 4 



and we let h be its determinant. Since h can be computed by the Laplace expansion 
either using the m x m minors of the last m rows, or using the k x k minors of the 
first k columns we deduce that h is in / D J. The specific term order we use together 
with the second observation above gives us the fact that m(h) = LCM(in(/), in(p)). 
This is the easiest to see by computing the Laplace expansion using the first \Y 2 \ + |Kj| 
rows of the matrix. □ 

Example 4.3 The proof of Lemma l4~21 relies on the construction of a special element 
h in / H J. We will now describe an example of this construction in the case m — 5, 
n — 6, and k = 3 and we will suppose that we are taking 3x3 minors from the 
last three columns of X mn . In other words, we illustrate the symmetrical case that 
we omitted in the above proof. We will consider the special case where / is the 
5x5 minor with column indices {1,2,3,4,6} and g is the 3x3 minor with row 
indices {2,3,5}. We can represent the situation pictorially with a marked matrix: 
the crosses x represent variables which appear in the leading term of / and the 
squares □ represent variables which appear in the leading term of g. Our marked 
matrix is 



X 




X 


□ 


X 


□ 




X 







According to the symmetric version of the construction, we take Y\ to consist of the 
last column of the matrix, Y 2 is the third to last column, Y3 is the second to last 
column, and Y4 consists of the first three columns. We construct the new matrix 
whose determinant is the desired polynomial h. In this new matrix, we again use 
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symbols to mark the desired variables in the leading term. This new matrix is a 7 x 7 
matrix and looks like 



' 











□ 


















□ 


X 














X 




□ 


□ 








X 






□ 








X 


X 





It is easy to see that \n(h) = LCM(in(/), in(g)): just use the Laplace expansion along 
the first two rows. □ 

Theorem 4.4 With respect to the lexicographic term order x\\ >- X\2 >-•••>- x>\ n >- 
e minors defining P? form a Grobner basis. 

Proof. We do induction on the number of intervals in T = {[ax, bi], . . . , [at, b t ]}. If 
r = {[0, n + 1]}, then Pp is just generated by the m x m minors of X mn and by the 
results in ^1] they form a Grobner basis. When there is more than one interval then 
I~" = T — [a t ,n + 1] is a prime sequence for [0, b t -\ + 1]. By induction, the set of 
minors F generating Ppi is a Grobner basis of / := I m b t -i{ m )- Now we let J be the 
ideal generated by the m x m minors corresponding to the interval [a t , b t ] and the 
maximal minors of the overlap [a t ,b t -i]. We let k := b t -\ — a t + 1, and we denote 
the set of these k x k minors together with the m x m minors that generate J by 
G. Lemma (4.21 implies that G is a Grobner basis of J. Now we will use Lemma [4. II 
to prove the theorem. Observe that if / G F and g 6 G are minors of submatrices 
corresponding to intervals or overlaps of intervals which do not share a column, then 
LCM(in(/), in(g)) = in(/) ■ in(g) and we choose h = f ■ g. Hence we only need to 
study the pairs of intervals that do overlap. Here is the list of the cases we need to 
consider: 

(a) both / and g are m x m minors, 

(b) / is an s x s minor coming from an overlap that also intersects the interval 
[at, n + 1], and g is an m x m minor, 

(c) / is as in (b), and g is a k x k minor, 

(d) / is an m x m minor coming from an interval that is not [a t -i,b t -i] and g is 
k x k minor, and 

(e) / is an m x m minor coming from [dt-i, h-i] and g is a k x k minor. 

The last case is covered by the proof of Lemma 14.21 In all the other cases, simple 
arguments show that the leading terms of / and g are relatively prime and hence 
we choose h = f ■ g. For completeness, we go through this argument for case (c). 
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The main tool is the following simple observation. For any maximal minor of any 
matrix, the leading term selected by our diagonal lexicographic term order has all 
of its variables lying in the parallelogram-shaped region bounded by the diagonal 
extending from the upper left hand corner of the matrix and the diagonal extending 
from the lower right hand corner. Since T is a prime sequence, the smallest interval 
[a, b] which contains the column indices of both / and g has width greater than or 
equal to m + 1. This ensures that the two regions corresponding to the possible 
variables in the leading terms of these minors do not intersect, because the diagonal 
from the upper left corner of X[a, b] is below the diagonal from the lower right corner 
of X[a, b}. This guarantees that the leading terms of / and g are relatively prime as 
desired. □ 

Corollary 4.5 The ideal Pr is radical. 

Proof. The initial ideal of Pr given by Theorem 14.41 is squarefree, and therefore it is 
radical. Then Pr is also radical. □ 

Corollary 4.6 IfT^V then Pr and Pp are incomparable. 

Proof. We will show that Pr is not contained in Pr'. For this it suffices to show 
that there is a minor among the generators of Pr which is not contained in Pr'. Let 
[aj,&j] be the first interval of V which is not contained in V and let [cj, c?J be the 
corresponding ith interval of V . The intervals [a\, b\], . . . [aj_i, are the first i — 1 
intervals which are common to both T and V. There are a few cases to consider. 

If i = 1 then [ai,fei] = [0, bi] and [ci,di] = [0, di\. Suppose that b\ > d\. Among 
the indices in the interval [d\ + 1, b\] there exists at least one index e so that [e, e] 
is not an interval obtained by overlapping two consecutive intervals in T'. Then the 
m x m minor with columns indices {1, . . . , m — 1, e} is contained in Pr but not in Pr' 
because its leading term is not divisible by any leading term in the Grobner basis for 
Pr'. If we suppose that b± < d%, then any (pi — 02 + 1) x (b\ — 02 + 1) minor with 
column indices [02, 61] belongs to Pr but not Pr' since its leading term is not divisible 
by any leading term in the Grobner basis for Ppi. 

Now we suppose that i > 1. The arguments are similar to those in the preceding 
paragraph and we sketch them briefly. Suppose a« < q. Then there is an m x m 
minor with column indices in [eij, bi] using the column index a, which is contained in 
Pr but not P r . If a« > q then there is an — Oj + 1) x — a« + 1) minor 
with column indices equal to [aj,6j_i] which is contained in Pr but not Pr'. Finally, 
if a; = q then a minor modification of the i — 1 case shows that Pr contains a minor 
which is not contained in Pr'. □ 

V(-Pr) is irreducible 

Before proceeding with the proof, we will outline the strategy that we will employ 
to show that V(Pr) is irreducible over a field K with char(i^) = 0. First, we will 
construct a morphism from an irreducible variety X to V(Pr). Then we will argue 



15 



that this morphism surjects onto a Zariski open subset W of V(Pr) when restricted 
to a Zariski open (and necessarily irreducible) subset y of X. This implies that W 
is irreducible. Up to this point the results will be obtained without any assumptions 
on the characteristic of the field. Then we will assume that K — C, and we will 
show that the closure of W is equal to V(Pr) which proves that V(Pr) is irreducible. 
This will require a perturbation argument which we present in the next subsection. 
Finally, we use standard arguments in the proof of Theorem I4.2UI to show that Pr is 
prime over any field of characteristic zero. 

We first define the irreducible variety X . In order to do this we need to introduce 
a poset Qr associated to a prime sequence T. 

Definition 4.7 Let T be a prime sequence. The elements of the poset Qr are certain 
subintervals of the intervals in T which will be defined recursively, and these subinter- 
vals are ordered with respect to inclusion. The intervals in V are the maximal elements 
of Qr, and we sort them with respect to each interval's starting index, the left border, 
in ascending order. These will form the elements in row 1. The elements in row 2 
are the nonempty subintervals obtained by intersecting two consecutive intervals in 
row 1. We also sort row 2 in ascending order with respect to the left borders. The 
subsequent rows are defined recursively: the elements in row r consist of all nonempty 
intervals that arise from the intersection of two consecutive elements from row r — 1. 
It is clear that every nonmaximal element is covered by exactly two elements (a left 
and a right parent), and each nonminimal element covers at most two other elements 
(a left and a right child). 

Example 4.8 Let m = 6 and consider the sequence of intervals 

T = {[0,7], [3, 9], [5, 11], [7, 13], [10, 17]}. 

The second row of the poset consists of the overlapping intervals [3, 7], [5, 9], [7, 11], 
and [10,13]. The third row is formed by the intervals [5,7], [7,9], and [10,11]. The 
fourth and final row of the poset is the interval [7, 7] . This poset is illustrated in 
Figure H □ 

In order to define X we need one more piece of information. This will be a positive 
integer attached to each element of Qr- 

Definition 4.9 For each p E Qr let 

{m — 1 if p is in the first row of Qr 
w(p) — 1 if p is in the second row of Qr 
w(p) otherwise 

where w(p) is the width of the interval p. 

Now each element p E Qr will give rise to a general linear group GL k ^ of invertible 
k(p) x k(p) matrices where k(p) = D(q) — D(q'), and q is the left parent of p and q' 
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[0,7] [3,9] [5,11] [7,13] [10,17] 

/\/\/ 

[3,7] [5,9] [7,11] [10,13] 




[5,7] [7,9] [10,11] 




[7,7] 

Figure 4: The poset Q T for r = {[0, 7], [3, 9], [5, 11], [7, 13], [10, 17]} 

is the left child of p. If p does not have a left parent then we set D(q) = m, and if 
p does not have a left child, then we set D(q') = 0. Moreover each maximal element 
q E Qr will give rise to an affine space A^ q \ and we define £(q) as follows: suppose 
q corresponds to the interval [a s ,6 s ] and let [a s+ i,6 s+ i] be the next interval (if there 
is one). Let A := [a s ,a s+ i — 1] or A := [a s ,n] if [a s ,6 s ] is the last interval. Now for 
each index i 6 A there is a unique p(i) G Qr which is minimal among all elements 
containing i. It is an easy exercise to see that p(i) G {po, ■ ■ ■ ,p r } where p = q and 
Pj + i is the left child of pj. With this we define £(q) = J2 ieA D(p(i)). Finally we arrive 
at the variety 

X = J] GL k[p) x JJ A««>. 

p£<2r 1 maximal 

We note that over an infinite field X is irreducible since it is the product of irreducible 
varieties. 

Next we define a map from X to A mn , the space of all m x n matrices. Given 
a point in x G X we will build an m x n matrix piece by piece using the intervals in 
T. We start with the last interval [a t ,n + 1] and the corresponding maximal element 
q G Qr- Then we build an m x |A| matrix Z as follows: for each % G A we set all 
entries in column % with row indices D(p(i)) + 1, D(p(i)) + 2, ... , m to zero. There 
are precisely £(q) entries in Z that are not set to zero yet, and we "plug in" the 
coordinates of the point x corresponding to to these entries. We set X := Z. 
Now let q = po,Pi, ■ ■ ■ ,Ps be the elements of Qr such that Pj+i is the left child of pj, 
and let gj G GL k ( Pj ) be the matrices that could be read off from the corresponding 
coordinates of x. For j — 0, . . . , s we define X := gj ■ X recursively, where gj ■ X is 
obtained by multiplying the last k(pj) rows of the first D(qj) rows of X, and qj is the 
left parent of pj (since D(qj) > k(pj), by the definition of k(pj) this makes sense). 

After we have gone through the sequence po, ■ ■ ■ ,p s , let the resulting matrix be 
Y. Next we move onto the second to last interval [a t _i,6 t _i], and using the set A 
associated to this interval we build a matrix Z, and then we set X := [Z|Y]. Now using 
the various invertible matrices associated to the sequence of the left children starting 
from [a t _i,6 t _i] we repeat this procedure. Clearly the result of this construction is 
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an m x n matrix. It is also clear that this map is a polynomial map and hence a 
morphism. 

Example 4.10 This is a detailed example displaying the variety X and the re- 
cursive construction of the map (j). Let m = 4 and let T be the prime sequence 
T = {[0,5], [3,7], [5,10]}. The second row of the poset Qr consists of the two in- 
tervals [3,5] and [5,7], and the third row of the poset is the singleton interval [5,5]. 
According to the construction of X we have 

X = GL 4 x GL 3 x GL 2 x GL 2 x GL 2 x GL 2 x A 6 x A 4 x A 11 . 

We have ordered the general linear groups and the affine spaces in the reverse of 
the order in which they are used in the map 0. This should not be confusing to the 
reader: the ordering of the general linear groups mimics the right to left order of 
group actions and the affine spaces are ordered in this way as a reminder that we 
construct the matrix in the image of <fi from right to left. Now let x be an arbitrary 
point in the variety X. We begin with the interval [5, 10], the last interval in T, and 
use the affine space A 11 to construct a 4 x 5 matrix Z which looks like 

* * * * 
* * 


and corresponds to columns 5 through 9 of our eventual completed matrix. We set 
X := Z. Now we read down the right-most chain in the poset and apply the action 
of general linear groups accordingly. In particular, we apply g\ G GL 2 to the bottom 
two rows of X, then g 2 G GL 2 to the middle two rows of g\ ■ X, and finally g% G GL 2 
to the first two rows of g 2 • gi • X. Pictorially, we have 

0**** g\ 0**** <?2 0**** 33 ***** 
* * — > * * — > * * * * — > * * * * 
oooooj |_ * * J |_ * * J |_ * * _ 

where the last matrix is the matrix Y obtained at the end of this iteration of the 
construction. Now we look at the second to last interval [3, 7] in T. Comparing with 
the interval [5, 10] we see that A = [3, 4], and we add two new columns Z to Y above. 
These come from our A 4 to arrive at a matrix X := [Z|Y]. Reading the second 
descending chain in <2 r we apply g 4 G GL 2 to the last two rows of X and then apply 
<?5 G GL 3 to the first three rows of g 4 ■ X. Pictorially, this looks like 
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And again the last matrix is the matrix Y obtained at the end of the second iteration 
of the construction. We are now at the last step and we adjoin two new columns Z to 
our matrix Y. The entries in these columns come from the A 6 . We form the matrix 

* * * * 

Since the interval [0, 5] has no left child, we deduce that we should apply g$ G GL 4 
to the entire matrix. This final matrix g§ • X is the image of x under <fi. □ 

Proposition 4.11 The image of <fi is contained in V(Pr). 

Proof. We need to show that for every x G X all the minors that generate P r vanish 
on 4>(x). We will prove this by using the definition of 0. First we observe that 
if a set of minors vanish on the partial matrix X in the definition of then after 
the row operations gj ■ X these minors will still vanish on X. We will show that 
as we build X each submatrix of X that corresponds to p G <2r has rank at most 
D{jp). This is certainly true after constructing X corresponding to the last interval 
of T, since at most the first D{pj) rows of each submatrix corresponding to pj are 
nonzero. An inductive argument shows that after applying gj to X, the columns of 
X that are in the submatrix corresponding to pt for k = 0, . . . ,j, but that are not 
in the submatrix corresponding to Pk+i have nonzero elements in at most the first 
D(q k ) rows where q k is the left parent of p k . So when Y is constructed at most the 
first D(qj) rows of the matrix corresponding to pj for j = 0, . . . , s are nonzero. In 
order to finish the proof by induction, we assume that after constructing the matrix 
Y for an interval in [a r , b r ] where r > 1 all the minors arising from the intervals 
[a r , b r ], [a r+ i, b r +i], . . . , [a t , n + 1] and their consecutive overlaps vanish on Y, and 
in the submatrices corresponding to the sequence p ,...,p s (where p is [a r ,6 r ]) at 
most the first D(qj) rows are nonzero, where qj is the left parent of pj. When we 
move to the next interval [a r _i,6 r _i] with the corresponding sequence of elements 
po, . . . ,p u , first we consruct [Z |Y]. It is easy to see that the submatrix Aj of this 
matrix corresponding to pj is obtained by concatenating the portion of Z contained 
in Aj with the submatrix corresponding to the right child of pj . Now at most the first 
Dipj) rows of the portion of Aj contained in Z are nonzero, and by induction the 
same is true for the submatrix corresponding to the right child of pj. Hence at most 
the first Dipj) rows of Aj are nonzero. This shows that the minors arising from the 
intervals [a r _i, 6 r -i], [a r , b r ], [a r+ i, 6 r +i], • • • , [(Hi n + 1] an d their consecutive overlaps 
vanish on X := [Z |Y]. After applying the row operations #j, at most the first D(q~j) 
rows of the matrix corresponding to pj will be nonzero where q~j is the left parent 
of pj because k{pj) = D(q~j) — D{pj +1 ). This implies that X has the properties the 
induction is based on, and this completes the induction. □ 

Now we let W be the subset of V(Pr) consisting of matrices X where the rank 
of each submatrix of X corresponding to p G Qr is equal to D(p). Since this subset 



X := [Z|Y] = 



* 


* 


* 


* 


* 


* 
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is defined by the non-vanishing of certain minors we conclude that it is a Zariski 
open subset of V(Pr). It is guaranteed to be nonempty by the results in the next 
subsection. Moreover, we let y be the set of x G X such that <p(x) G W. We argue 
that y is an open subset of X. For this, consider an x G X where we take the entries 
as indeterminates. Then 4>(x) is a matrix with polynomial entries in the coordinates 
of x. Thus y is defined by the non- vanishing of certain minors of (f>(x). Furthermore, 
y is irreducible since X is irreducible. 

Proposition 4.12 The morphism : y — > W is surjective, and therefore W is 
irreducible when K is an infinite field. 

Proof. Since the second statement follows from the first we just prove the first claim. 
We will do this by constructing x & y for each X G W such that <j)(x) = X. We start 
with the first interval p = [ai, bi] in T. Since X[ai, 61] has rank D(p) — m — 1, we can 
find a g G GL k u,\ where k(p) = m so that g ■ X[ai,6i] is row-reduced, in particular, 
the last row is a zero row. We let X = g • X, and we record g~ x as well as the entries 
of the first D(p) = m — 1 rows of each column of X with column index i 6 A (see the 
definition of A in the paragraph before Example I4.10|) as part of the element x we are 
constructing. Then we delete these columns from X to obtain the new X. 

By induction, suppose we have gone through the intervals [a 1; bi], . . . , [a r _i, & r _i] 
and the matrix X = X[a r ,n] has the following properties. Let Po,Pi, ■ ■ ■ ,p s be the 
sequence of elements where po is [a r _i, 6 r _i] and Pj+\ is the left child of pj. Then the 
only nonzero rows of the submatrix of X corresponding to the right child of pj are 
the first D(j)j) rows of this submatrix. 

Now we let po, ... ,p u be the sequence where po is the interval [a r , b r ] and pj + ± is 
the left child of pj. We observe that, by induction, only the first D(q~j) rows of the 
submatrix of X corresponding to pj could be nonzero where q~j is the left parent of 
pj. Now we apply g~j G GL^^ to X successively starting from j = u and finishing 
with j = 0. In this process gj will be chosen as the matrix which will be applied 
to the last k(pj) rows of the first D(qj) rows of X so that after applying gj, the 
submatrix of X corresponding to pj is row reduced. Since we have assumed that 
the submatrix corresponding to pj + \ has rank equal to D(pj +1 ), this implies that 
the submatrix corresponding to pj with row indices D(j>j + i) + 1, D(pj + i) + 2, . . ., 
D(pj + i)+k(pj) has rank k(pj)—l. This implies that after applying gj the rows indexed 
by D(pj) + l,D(pj) + 2, . . . , m in the submatrix corresponding to pj will consist of 
zeros. Moreover, when applying g~j, the definition of k(pj) and the particular rows 
which will be affected guarantee that the zero rows of the submatrices corresponding 
to pj+i, ■ ■ ■ ,p u stay as zero rows. Hence when we compute X := g ■ X, we return 
to the property we started with at the beginning of the induction step, namely: the 
only nonzero rows of the submatrix of X corresponding to pj are the first D(pj) rows 
of this submatrix. 

Now we delete the submatrix X[a r , a r+1 — 1] from X to obtain the new X for 
the next iteration, and we record the £(po) possibly nonzero elements in the deleted 
columns of X as part of x (this belongs to A^°)) as well as the inverses of all the 
matrices g~j G GL k ( p ^ which we used. Since we have returned X to the form of the 
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inductive hypothesis, this shows that we can continue the procedure to compute an 
x G y whose image under is X. 

□ 

We conclude this section with the proof that in certain special cases, the map 
: X — > V(Pr) is, in fact, surjective (i.e. not just surjective on an open subset). 
Hence, in these cases we may conclude that Pp is prime without resorting to the 
analytic techniques in Proposition 14. 19l 

Proposition 4.13 Suppose that Qp has only two rows. Then the map <fi : X — > V(Pr) 

is surjective. 

Proof. We will closely follow the proof of Proposition 14. 12l but with an extra twist. 
Given an X G V(Pr-) we will construct x E X such that <p(x) = X. We start with the 
first interval p = [ax, b\] in V . Since X[ai, &i] has rank P(p) = m — 1, we can find a 
g G G^fc(p) where k(p) — m so that # • X[oi, &i] is row-reduced, in particular, the last 
row is a zero row. We let X = g ■ X, and we record g~ x as well as the entries of the 
first D(p) = m — 1 rows of each column of X with column index i G A as part of the 
element x we are constructing. Then we delete these columns from X to obtain the 
new X. 

Since Qr has only two rows, our inductive hypothesis is simpler than Proposition 
14.121 Namely, suppose that we have gone through the intervals [ax, bx], ■ ■ ■ , [a r -i> K-i] 
and the submatrix of X = X[a r , n] indexed by A = [a r , 6 r -i] has its last row as a zero 
row. 

We will let po = [a r , b r ] and px = [a r ,b r -x\- Since <2r has only two rows, these 
are the only two elements of Qr which we need to consider when we perform our 
induction. First we use an element of gx G GL m _ l to row reduce the submatrix 
consisting of the first k(px) = m — 1 rows of X and the columns indexed by px- This 
submatrix has rank < D(px) = &r-i — a r'- if the rank of the submatrix is strictly 
less than D(px) we must perform our row reductions with caution to ensure that 
the submatrix of X with columns indexed by [b r -x + 1, b r ] and consisting of the last 
k(po) = m — D(px) rows has rank less than m — D(px). To ensure this possibility, we 
note that there are two cases to consider. In the first case, the submatrix consisting 
of its first m — 1 rows of X [a r , b r ] has rank m — 1 . In this case we can choose gx so 
that the (m — l)st row of gx ■ X[a r ,6 r ] is a multiple of the last row of X[a r , 6 r ]. In 
the second case, the submatrix consisting of the first m — 1 rows of X[a r , b r ] rank 
< m — 1. Then we can choose gx so that the (m — l)st row of gx ■ X[a r , b r ] is the zero 
row. In either case, this ensures that the last m — D(px) rows of X[6 r _i + l,b r ] has 
rank less than m — D(px). Now we apply row reduction via g in GL k ^ Q \ to the last 
m — D(px) rows of X to bring X into the form of the inductive hypothesis. 

To complete the proof, we record the entries in first D(px) rows of X[a r , fe r _i], 
and the entries in the first D(p ) rows of X[6 r _x + 1, a r+1 — 1] (this becomes a set of 
entries in A^°)). We also record the inverses of gx and go> an d we delete the first 
|A| = a r+1 — a r columns from X to arrive at X := X[a r+ i,n]. By our construction, 
this matrix is in proper form of the inductive hypothesis, and so we may continue the 
process to construct x such that <f)(x) = X. □ 
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Corollary 4.14 Let K be a field of arbitrary characteristic and suppose that Qr has 
only two rows. Then Pr is a prime ideal. 

Proof. If K is algebraically closed, Proposition 14.131 and Corollary 14.51 together with 
the Nullstellensatz imply that Pr is a prime ideal. But this implies Pr is prime over 
any field by passing to the algebraic closure. □ 

Corollary 4.15 If m <3, then Pr is prime for any prime sequence T. 

Proof. For m < 2 the statement was proven in [Jj. When m = 3, each interval 
[cij, bj\ G T has width greater than or equal to 4, whereas each of the overlapping 
intervals [ai,6j_i] and [aj + i,&j] has width less than or equal to 2. This implies that 
the intervals [aj,&i_i] and [aj + i,6j] do not overlap and so Qr has only two rows. By 
Corollary I4.14[ Pr is a prime ideal. □ 

The reader may wonder why we have not shown that <p : X — > V(Pr) is surjective 
in general, eliminating the need for the analytic arguments in Proposition 14.191 In 
general, it is not clear if this is true; so we state it as a question. 

Question 4.16 Is the morphism <fi : X — > V(Pr) always surjective? 

We do not even know the answer in the case m = 4 with T = {[0, 5], [3, 7], [5, 10]} 
from Example I4.1UI which is essentially the smallest instance not covered by Propo- 
sition 0^21 An affirmative answer to this question would imply that Pr is prime for 
all T and in arbitrary characteristic. 

The perturbation argument 

We now present the details of the argument that every point of V(Pr) is arbitrarily 
close to W when the underlying field is C. It suffices to show that given a matrix 
X G V(Pr) \ W, there exists an infinitesimal perturbation which will make the rank of 
all the submatrices corresponding to q G Qr equal to D(q). Making this perturbation 
requires care, since an arbitrary perturbation might force the rank of some submatrix 
to jump to a value greater than D(q), and this will result in a matrix that is no longer 
in V(P r ). 

For notational convenience we denote by Qr(X) the poset Qr where the elements 
are taken to be the actual submatrices instead of the intervals. This way, for instance, 
we will be able to work with span(p) of p G <2r(X) which will mean the vector space 
spanned by the columns of p. Similarly dim(p) will denote the dimension of this 
vector space. 

Definition 4.17 Let p be an element of the poset Qr(X). We let A4(p) be the set 
of elements of Qr(X) above p whose rank is equal to the desired maximal rank: 

M(p) = {qe Q r (X)\q > p, dim(g) = D(q)} . 
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Next we define Per(p), the vector space of allowable perturbations to be 



Per(v) = I n « e ^) Span(g) if M{P) * 
m \ C m HM(p) = $ 

Lemma 4.18 Let p be an element of the poset Q r (X). Then 

D(p) < dim Per (p), 

that is, there is a large enough vector space in which perturbations can be made. 

Proof. We suppose throughout that dim(p) < D(p) since in the case of equality there 
is nothing to prove. Assuming this, the case where Ai(p) = is trivial, so suppose 
M.(p) is nonempty. If p is in the first row of Qr(X) there is nothing to show. If p is 
in the second row then dim Per (p) > m — 2 whereas D(p) < m — 2 by the definition 
of T. So suppose that p is in at least the third row of the poset. 

Clearly it is enough to take the minimal elements of M. (p) when computing Per(p). 
Furthermore, since dim(p) < D(p) = w(p) we see that no q > p can belong to Ai(p) if 
q is in at least the third row. Otherwise for such a q to be in Ai (p) would require that 
q has its rank equal to its width. But then dim(p) = w{p) which is a contradiction. 
With this in mind, we first prove the inequality in the statement of the lemma when 
the minimal elements of M(p) consist of elements in the second row of the poset. 

Let q 1 , q 2 , ■ ■ ■ , q r be all the elements in the second row of the poset which are larger 
than p. We assume that each qj spans a subspace of dimension w(qj) — 1 :— m — ij — 1 
where ij > 0. Now consider the intersection of the vector spaces spanned by the 
qi. Since qi and q 2 are sub matrices of q\ V q 2 which has rank less than or equal to 
m — 1, we deduce that the vector space span^) n span(g 2 ) has dimension at least 
m — i± — i 2 — 1. By induction, the vector space 

r 

Per(p) = p| span(g i ) 

3=1 

has dimension > m — 1 — ^ . ij. On the other hand, w(p) < m — r + 1 — . ij since 
the width of the intervals in T is at least m + 1, and this completes the proof in the 
case when the minimal elements of M.{p) consist of elements in the second row of the 
poset. The general case now follows because removing one of the qj from M.(p) (and 
possibly adding something from the first row) can only make dim Per (p) larger. □ 

Now we show how perturbations should be made inside a given rank-deficient 
matrix X so that every sub matrix corresponding to p G <2r(X) has maximal rank 
D{p). 

Proposition 4.19 Let X e V(Pr) be a matrix such that dim(p) < D{p) for an 
element p e <2r(X). Then there is an infinitesimal perturbation o/X to X' e V(Pr) 
such that the rank of the corresponding p' in Q r (X) increases, and dim(g) for any 
other element in <2r(X) does not decrease. 
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Proof. We can assume that p is minimal in Qr(X) among the submatrices that are 
rank-deficient. It suffices to show that we can increase the rank of this submatrix by 
one. There are two cases to consider. 

Case 1: There is a column x of p that does not belong to any child of p and is a 
linear combination of the rest of the columns of p (for instance this happens when p 
has at most one child). In this case we choose a vector x G Per(p) \ span(p) which 
is guaranteed to exist by Lemma f4. 181 Then adding an infinitesimal multiple of x to 
x increases dim(p) without increasing the rank of any of the matrices in Ai(p), and 
hence does not change the fact that X satisfies the minors of Pr. 

Case 2: Our element p has a left child p\ and a right child P2, but none of the 
columns of p that are not in pi or p 2 can be written as a linear combination of the 
rest of the columns of p. We cannot add a vector x G Per(p) to any part of p which 
will increase dim(p) without risking the increase of dim(pi) or dim(j>2)- We let q be 
the common child of p\ and p 2 , and if there is no such child we let q — 0. Now 
there exists a column x of p\ that is not in span(g). This is clear when q — 0, and 
otherwise this follows from the minimality assumption on p. Now we choose a vector 
x G Per(p) \ span(p) which is almost parallel to x, and we assume that both vectors 
have the same norm. We let B be a basis of C m that contains the columns of q (the 
columns of q are linearly independent since q is in at least the third row of Qr(X)) as 
well as a basis for span(pi), and in particular x. This implies that B = B \ {x} U {x} 
is also a basis for C m . We let T be the change of basis matrix from B to B. Now 
assuming that P2 = [a, b], we perturb X and obtain 

X = [ T-X[l,o- 1] | X[a,n] ] . 

Since x is almost parallel to x and both vectors have the same norm, the linear 
transformation T is small in the sense that it is close to the identity matrix in the 
Euclidean topology. Furthermore, this perturbation increases dim(p) by one, and any 
submatrix q G M. (p) will not change its rank. The rank of any submatrix q > p with 
q ^ M.(p) increases by at most one, and hence dim(g) < D(q) after perturbing by T. 
Finally, a submatrix which does not contain p is either unchanged or is changed by 
applying an element of GL m (C) which does not alter the rank. This implies that our 
new perturbed matrix is in V(Pr) and completes the proof that we can always make 
perturbations to improve the ranks of rank-deficient submatrices. □ 

Theorem 4.20 Let K be a field of characteristic zero. Then Pp is a prime ideal. 

Proof. First suppose that K — C. Corollary 14.51 savs that Pr is radical and Propo- 
sitions E^^J and imply that V(Pr) is irreducible, hence Pp is prime by the Null- 
stellensatz. Now we apply the Lefschetz principle to deduce that Pr is prime over an 
arbitrary field K of characteristic zero. For this suppose there are f,g in i^[xy] with 
fg G P r but f,g £ Pr- Then fg G P r but f,g £ P r over the field Q{{c a }) where {c a } 
is the finite set of coefficients of / and g. Since C has infinite transcendence degree 
over Q and is algebraically closed, and these fields have characteristic zero, Q({c a }) 
can be embedded as a subfield of C. The images of / and g under this embedding 
will show that Pr is not prime over C. This is a contradiction. □ 
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5 Higher Dimensional Adjacent Minors 

Let m = (mi, . . . rrid) G Z d with all rrij > 2 and let X m be the generic d-dimensional 
rrii x • • • x nid matrix with entries Xi u ... t i d . Throughout this section we will call any 
integer vector u = (u\, . . . , Ud) even if Yl u j ls even, and odd otherwise. 

Definition 5.1 Let i = [i\, . . . , id) be an integer vector with 1 < ij < mj — 1 for all 
j. A multidimensional adjacent 2-minor is a binomial of degree 2 d ~ 1 of the form 

{ [ ''i • < " \ { X i • < • 

e£{0,l} d ££{0,l} d 
e even e odd 

Furthermore we let 7 m (2) be the ideal in K[x{\ generated by all the multidimensional 
adjacent 2-minor s. 

The ideal / m (2) generalizes the ideals I mn (2) of 2 x 2 adjacent minors from Section 
2. The set of vectors {u — v : x u — x. v is an adjacent 2-minor} is a basis for the lattice 
of ^-dimensional mi x ■ • ■ x integral matrices with all line sums equal to zero |12j . 
A line sum of a matrix with entries u\ is any sum of the form 




This is actually only a very special case of the types of marginals which one may 
compute of multidimensional matrices. In fact, any marginal computation of a multi- 
dimensional matrix leads naturally to a lattice of integer matrices with all marginals 
equal to zero. From this lattice, we can extract a lattice basis of generalized adja- 
cent minors |12j . and construct an ideal of generalized adjacent minors. The general 
results on lattice basis ideals in ^T] imply that every minimal prime of these ideals 
of adjacent minors is of the form in equation (0), so we only need to determine the 
variables which appear in each minimal prime. 

The similarity between the 2x2 adjacent minors for two dimensional matrices 
and the higher dimensional adjacent minors we describe in this section is somewhat 
misleading. One important difference is that the higher dimensional minors do not 
describe rank conditions on tensors, so the linear algebra arguments which we applied 
in Sections 3 and 4 no longer succeed. This problem aside, one might still hope that 
the partition of variables which arises in the description of the minimal primes of 
ideals of higher dimensional adjacent minors might still provide a decomposition of 
the multidimensional matrix into rectangular chambers and their boundaries. Unfor- 
tunately, this hope is far from the true description of the minimal primes. In this 
section, we describe the minimal primes in a few special instances, showcasing the 
increasing complexity which arises in higher dimensions. 

Example 5.2 Let d = 3 and m = (2,2,3). The ideal of multidimensional adjacent 
minors is 
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^2,2,3(2) — ( ^111^122^212^221 ~ ^112^121^211^222; ^112^123^213^222 ~ ^113^122^212^223)- 



If we choose a term order which selects the underlined terms as the leading terms, 
these leading monomials are relatively prime, and hence this ideal is a radical complete 
intersection. The five minimal primes of £2,2,3(2) are the ideals 

(Xn2, X122), (^222, :£l22), (^222, #212), (#212) #112/) 

and 

£2,2,3(2) : (J^^Tc) 00 = £2,2,3(2) + (xinXi23a:213a;221 - £113X121X211X223)- 

□ 

Generalizing Example 15.21 it is possible to give a combinatorial description of 
the minimal primes of the ideal of multidimensional adjacent minors whenever m = 
(2, 2, ... , 2, m). This is the content of the following theorem. 

Theorem 5.3 Let J m (2) be the ideal of adjacent 2-minors where m = (2, 2, . . . , 2, m) . 
The minimal primes of I m (2) are of the form as in (QJ) where the set S of variables 
is a collection of the pairs of variables x slt _ jSd _ 1; j. and x tl> ... t t A _ 1) j i chosen for each ji 
in the (possibly empty) set J = {2 < j 1 < ■ ■ ■ < ji < m — l\ji + 1 < j, t+ i} such that 
the index vector of the first variable is even and the second one is odd. Moreover, if 
we let fd{fn) denote the number of minimal primes of this ideal, then the function fd 
satisfies the recurrence relation 

f d (m + l) = f d (m)+4 d ~ 2 f d (m-l), 
with initial conditions /d(l) = f d {2) — 1- 

Proof. Note that if P is a minimal prime of J m (2) and contains a variable x Slv .. jSd li j 4 
whose index set is even (or odd) then it must contain some other variable Xt u ... } t d _- i _,j i 
whose index set is odd (or respectively even). Moreover P cannot contain another 
variable with last index ji because this would contradict the minimality of P. Any 
adjacent 2-minor that contains these two variables must contain two other variables 
of opposite parity with either a last index ji + 1 or ji — 1, therefore no variable of this 
form is needed in P. A similar reasoning implies that the variables with last index 1 
or m do not appear in P either. This shows that every minimal prime has the desired 
form. To see that every ideal of the form we described is a minimal prime one needs 
merely note that there are no containment relations between these ideals. 

Now we prove the recurrence relation. Let P be a minimal prime arising from the 
sequence J. If the last index in J is not equal to m — 1, then the sequence J and the 
choice of variables provides a minimal prime for J m (2) where m = (2, 2, . . . , 2, m — 1). 
If the last index in J is equal to m — 1, then removing it from the sequence (and 
the corresponding variables from S) produces a minimal prime Q for J m (2) where 
m = (2,2, ... ,2, m — 2). There are precisely 4 d ~ 2 minimal primes P that would give 
rise to Q since there are 4 d ~ 2 possible pairs of variables with last index m — 1 and 
having opposite parity. □ 
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Aside from Theorem 15.31 we do not know of any general characterization of the 
minimal primes of these ideals of higher dimensional adjacent minors. We conclude 
this section with an example which shows that these minimal primes do not have 
the same appearance as in the two dimensional case, where the partition of variables 
corresponded to rectangular subregions and their boundaries. 

Example 5.4 Let m = (3, 3, 3). Then there are sixty-seven minimal primes of J m (2) 
which fall into nine symmetry classes modulo the natural symmetry of the cube. In 
the table below, we display the set of variables S which appear in the representative 
minimal primes, as well as the number of minimal primes in a given symmetry class, 
and the degree of the corresponding prime ideal. 



s 


size 


degree 





1 


2457 


{X22I, %222, ^223} 


3 


1 


{Xi21, X122, ^123} 


12 


81 


{^121, X122, ^123, ^223, ^323} 


12 


12 


121; £122, ^123) £232, 2:332} 


12 


12 


{^122? ^322; ^211) ^213? ^231? ^233} 


3 


1 


{^121) ^122) 3?123j £321, ^322, X323} 


6 


1 


{ 3^121) #122) £123, £312, ^322, £332} 


6 


1 


{£121) #123, ^232, ^332, ^212, X312} 


12 


1 



□ 

References 

[1] S. Aoki and A. Takemura. Minimal basis for connected markov chain over 
3x3xk contingency tables with fixed two-dimensional marginals. Technical re- 
port, METR 02-02, 2002. 

[2] W. Bruns and J. Herzog. Cohen-Macaulay rings, volume 39 of Cambridge Studies 
in Advanced Mathematics. Cambridge University Press, Cambridge, 1993. 

[3] W. Bruns and U. Vetter. Determinantal rings, volume 1327 of Lecture Notes in 
Mathematics. Springer, Heidelberg, 1988. 

[4] A. Conca. Grobner Bases and Determinantal Rings. PhD thesis, Universtat 
Essen, 1993. 

[5] A. Conca. Ladder determinantal rings. Journal of Pure and Applied Algebra, 
98:119-134, 1995. 

[6] C. De Concini, D. Eisenbud, and C. Procesi. Young diagrams and determinantal 
varieties. Invent. Math., 56:129-165, 1980. 



27 



[7] P. Diaconis, D. Eisenbud, and B. Sturmfels. Lattice walks and primary decom- 
position. In Mathematical essays in honor of Gian- Carlo Rota (Cambridge, MA, 
1996), pages 173-193. Birkhauser Boston, Boston, MA, 1998. 

[8] S.E. Fienberg. The Analysis of Cross- Classified Categorical Data. The MIT 
Press, Cambridge, MA, 2nd edition, 1980. 

[9] N. Gonciulea and C. Miller. Mixed ladder determinantal varieties. Journal of 
Algebra, 231:104-137, 2000. 

[10] M. Hochster and J. A. Eagon. Cohen-Macaulay rings, invariant theory, and the 
generic perfection of determinantal loci. Amer. J. Math., 93:1020-1058, 1971. 

[11] S. Ho§ten and J. Shapiro. Primary decomposition of lattice basis ideals. Journal 
of Symbolic Computation, 29(4-5):625-639, 2000. 

[12] S. Ho§ten and S. Sullivant. Grobner bases and polyhedral geometry of reducible 
and cyclic models. Journal of Combinatorial Theory, Series A, 100:277-301, 
2002. 

[13] G. Pistone, E. Riccomagno, and H.P. Wynn. Algebraic Statistics: Computational 
Commutative Algebra in Statistics. Chapman & Hall, 2001. 

[14] B. Sturmfels. Grobner bases and Stanley decompositions of determinantal rings. 
Mathematische Zeitschrift, 205:137-144, 1990. 

[15] B. Sturmfels. Solving Systems of Polynomial Equations, volume 97 of CBMS 
Regional Conference Series in Mathematics. American Mathematical Society, 
Providence, RI, 2002. 

[16] T. Svanes. Coherent cohomology on Schubert subschemes of flag schemes and 
applications. Advances in Mathematics, 14:369-453, 1974. 



28 



